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ABSTRACT 

Nucleotide excision repair (NER) is a major repair 
pathway that recognizes and corrects various 
lesions in cellular DNA. We hypothesize that 
damage recognition is an initial step in NER that 
senses conformational anomalies in the DNA 
caused by lesions. We prepared three DNA 
duplexes containing the carcinogen adduct N-(2 - 
deoxyguanosin-8-yl)-7-fluoro-2-acetylaminofluorene 
(FAAF) at d, G 2 or G 3 of Nar\ sequence (5-CCG^G 2 
CG 3 CC-3). Our 19 F-NMR/ICD results showed that 
FAAF at and G 3 prefer syn S- and W-conformers, 
whereas anti B-conformer was predominant for G 2 . 
We found that the repair of FAAF occurs in a 
conformation-specific manner, i.e. the highly S/ 
W-conformeric G 3 and -G^ duplexes incised more 
efficiently than the B-type G 2 duplex (Ga^Gi > G 2 ). 
The melting and thermodynamic data indicate that 
the S- and W-conformers produce greater DNA dis- 
tortion and thermodynamic destabilization. The N- 
deacetylated N-(2 -deoxyguanosin-8-yl)-7-fluoro-2- 
aminofluorene (FAF) adducts in the same Nar\ 
sequence are repaired 2- to 3-fold less than FAAF: 
however, the incision efficiency was in order of 
G 2 ~G 1 > G 3 , a reverse trend of the FAAF case. We 
have envisioned the so-called A/-acetyl factor as it 
could raise conformational barriers of FAAF versus 
FAF. The present results provide valuable conform- 
ational insight into the sequence-dependent 



UvrABC incisions of the bulky aminofluorene DNA 
adducts. 



INTRODUCTION 

Adduct formation is an important aspect of DNA 
damage: if unrepaired, various mutations in DNA could 
occur (1-3). The presence of mutations on specific onco- 
genes or tumor suppressor genes may trigger cancer initi- 
ation. Human cells are armed with various effective repair 
pathways to safeguard genomic DNA from continuous 
assault by exogenous and endogenous sources (4). 
Nucleotide excision repair (NER) is a major repair 
pathway that is known for the removal of stretches of 
bases containing various lesions, including single-base 
damages, bulky adducts and cross-links, among others 
(5). Deficiencies in NER are closely associated with the 
development of several genetic diseases, such as xero- 
derma pigmentosum that increases the risk of skin 
cancer due to higher sensitivity to sunlight (6). 

The NER pathway in Escherichia coli involves the 
UvrABC nuclease system and has been studied extensively 
for understanding DNA damage recognition and incision. 
E. coli NER is initiated following damage recognition by a 
dimeric UvrA protein. Next, UvrB protein reaches the 
damage site, forms a trimer and verifies the damage. 
Departure of UvrA from the resulting complex recruits 
UvrC and UvrD proteins, which cleave and remove the 
lesion-bearing patch of DNA. Finally, DNA polymerase I 
synthesizes and ligase I seals a new patch to complete the 
repair process (5,7). 
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Arylamines are an important class of environmental 
pollutants that are implicated in the etiology of human 
cancers, especially of the bladder and liver (1). 
2-Acetylaminofluorene was originally developed as an 
agricultural insecticide, but was later banned due to its 
strong tumorigenic activity in rat livers (8). It has been 
used extensively as a model for studying chemical carcino- 
genesis. In vivo, metabolic activation of AAF produces a 
highly electrophilic nitrenium ion, which subsequently 
interacts with DNA to produce two major C8-substituted 
dG adducts: AAF and AF (Figure la) (8,9). In vitro, 
7V-acetylated AAF blocks the activity of high-fidelity poly- 
merases and requires bypass polymerases for a translesion 
synthesis (TLS), whereas AF only slows down replication 
(10). In general, the bulky AAF exhibits greater suscepti- 
bility towards NER than AF (11), which is known to 
exist in a sequence-dependent equilibrium between 
anti B-conformer and syn S-conformer (Figure lc) 
(10,12-14). We recently reported that AAF adducts also 
adopt a sequence-dependent S/B/W-conformational equi- 
librium (Figure Id) (15). 

Local sequence context plays an important role in the 
repair of arylamine-DNA adducts (16). Fuchs et al. con- 
structed DNA sequences modified with AAF at each of 
three guanines of the most frequently studied mutational 
hotspot known as Narl sequence (5 ; - . . . CGiG 2 CG 3 
CC . . . -3') and tested their substrate repairability in the 
E. coli UvrABC and human exonuclease systems. In 
E. coli, the three AAFs were repaired in a sequence- 
dependent manner, with relative repair efficiencies of 
Gi:G 2 :G 3 in a ratio of 100:18:66 (17,18). However, differ- 
ent repair efficiencies were observed for the same lesions 



by the human exonuclease, 38:100:68 for Gi, G 2 and G 3 , 
respectively (18). AAF at G 3 of Narl sequence induces 
~ 100-fold greater frequency of —2 frameshift (—2 
deletion) mutations, even though the three guanines 
exhibit similar chemical reactivities (19). We have shown 
that the FAF-modified Narl -2 deletion duplex in the 5 - 
CGiG 2 C G 3 *CC -3 / context adopts a single looped-out 
bulge structure, whereas the 5 / -CGiG 2 C G 3 *CT- 3 / 
context results in a local conformational heterogeneity 
(20). These results support the importance of the 3'-next 
flanking nucleotide to the lesion in modulation of 
mutation efficiency. The studies verified that the conform- 
ational stability of a slipped mutagenic intermediate is a 
critical determinant for the hotness (up to 30- to 50-fold) 
of G 3 in Narl sequence for —2 frameshift mutation (20- 
22). Mekhovich et al. (23) found a greater incision rate in 
E. coli systems when AAF was located at G 3 of the Narl 
sequence (5'— CG 1 G 2 CG 3 *CC— 3 ) than in a non-Narl 
sequence (5' — GATG*ATA — 3'). Zou et al. (24) have 
reported that the UvrABC incision efficiency is 70% 
more in the TG*T than in the CG*C sequence context 
when adducted with either AF- or AAF lesions. 

The NER pathway is characterized by its unique ability 
to excise a wide array of structurally diverse DNA lesions. 
The structure of individual adducts per se is not as import- 
ant as lesion-induced local distortions and destabilizations 
to trigger a NER response. Examples include disruption of 
Watson-Crick hydrogen bonding, DNA bending, thermo- 
dynamic destabilization, local conformational flexibility 
and flipped-out bases in the unmodified complementary 
strand (25,26). However, mechanisms of sequence depend- 
ence that control NER efficiencies remained elusive. 




AAF: X=H / R=COCH 3 
FAAF: X=F, R= COCH 3 
AF: X=H, R=H 
FAF: X=F, R= H 



(b) 



5 ' -CTCTCG 1 G 2 CG 3 CCATCAC-3 ' 
3 ' -GAGAGC C GC GGTAGTG-5 ' 

5 ' -CTCG 1 G 2 CG 3 CCATC-3 ' 
3'-GAGC C GC GGTAG-5 ' 



G= FAAF or FAF 



Atari 16-mer 



Narl 12-mer 




Figure 1. (a) Chemical structures of AAF, FAAF, AF and FAF adducts; (b) sequences of fully paired 16-mer and 12-mer Narl duplexes used in the 
present study; major groove views of the central trimer segments of (c) the B/S and (d) B/S/W-conformer equilibrium of FAF and FAAF-modified 
duplexes. The modified dG and the complementary dC are shown in red and green sticks, respectively, and the aminofluorene moiety is highlighted 
with shiny gray CPK and the 7V-acetyl with pink CPK. In the B-type conformer, anti-[FAAF/FAF]dG maintains Watson-Crick hydrogen bonds, 
thereby placing the carcinogen moiety in the major groove. The carcinogens in the S- and W-conformers stack into the helix or wedged into the 
minor groove, respectively, with the modified dG in the syn conformation. 
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It could be that the observed local sequence effects 
described in the previous paragraph for AF and AAF 
are due to differences in the extent of distortions, which 
in turn depends on the conformation adopted in a particu- 
lar sequence context. 

In the present work, we investigated the role of con- 
formational heterogeneity in the structure-repair relation- 
ships of AF and AAF. These two adducts are structurally 
similar, but differ in the absence and presence, respective- 
ly, of an 7V-acetyl group on the central nitrogen. We 
prepared oligonucleotides that were site-specifically 
modified by the fluorine model FAF and FAAF at three 
different guanines (Gi, G 2 and G 3 ) of the Narl recognition 
sequences (Figure la and b). We conducted spectroscopic 
and melting experiments for conformational and thermo- 
dynamic analyses. Moreover, we performed NER studies 
of these adducts using the E. coli UvrABC system. The 
results present strong structural and thermodynamic evi- 
dences for the differential NER efficiencies exhibited by 
AF and AAF at different guanine residues of the Narl 
sequence. 



MATERIALS AND METHODS 

Caution 

2-Aminofluorene derivatives are mutagens and suspected 
human carcinogens and therefore must be handled with 
caution. 

Crude oligodeoxynucleotides (ODN, lOjimol scale) in 
desalted form were purchased from Eurofins MWG 
operon (Huntsville, AL, USA). All HPLC solvents were 
purchased from Fisher Inc. (Pittsburgh, PA, USA). 

Preparation and characterization of FAF- and FAAF- 
modified ODNs 

We previously reported the preparation of 12-mer Narl 
ODNs (5 / -CTCG 1 G 2 CG 3 CCATC-3 / ), in which each of 
the three guanine were site-specifically modified by FAF 
(20). We have also demonstrated that the incorporation of 



fluorine atom at the longest axis position 7 does not affect 
the overall conformational and thermal/thermodynamic 
profiles of AF- or AAF-modified duplexes (14,20). The 
three FAF-modified Narl sequences were each annealed 
with a complementary 12-mer sequence (5 -GATGGCGC 
CGAG-30 to form fully paired Narl-G x -¥K¥, Narl-G 2 - 
FAF and 7VtfrI-G 3 -FAF duplexes, respectively. These 
duplexes were thoroughly characterized by 19 F-NMR, 
CD and UV melting experiments (20). 

FAAF-modified 16-mer ODN were prepared using 
the general procedures described previously (15,27). 
Briefly, approximately 0.5-1 mg of N-acetoxy-N-2- 
(acetylamino)-7-fluorofluorene dissolved in absolute 
ethanol was added drop wise to a sodium citrate buffer 
(pH 6.0) containing 200-250 ODs of unmodified ODN 
(5 / -CTCTCG 1 G 2 CG 3 CCATCAC-3 / ) and placed in a 
shaker for 5min at 37°C. Figure 2a shows a typical 
reversed-phase HPLC chromatogram derived from the re- 
sulting mixture. The FAAF modified oligomers appearing 
between 28 and 85min were separated and purified up to 
>97% purity by repeated injections. The HPLC system 
consisted of a Hitachi EZChrom Elite HPLC unit with 
an L2450 diode array detector and a Phenomenex Luna 
C18 column (150 x10mm, 5.0 urn). We employed a 
gradient system involving 3-15% acetonitrile for 40min 
followed by 15-20% and 20-35% acetonitrile for 20 and 
40min, respectively, in pH 7.0 ammonium acetate buffer 
(100 mM) with a flow rate of 2.0ml/min. 

The three FAAF-modified 16-mer sequences were each 
annealed with the complementary sequence (5 ; -GTGATG 
GCGCCGAGAG-30 to form fully paired Narl-G x - 
FAAF, 7V<2rI-G 2 -FAAF and AfarI-G 3 -FAAF duplexes 
for UV melting, DSC, CD and dynamic 19 F-NMR 
experiments. 

LC/MS characterization of FAAF-modified ODNs 

Electrospray ionization and quadrupole time-of-flight 
mass spectrometry was utilized to verify the molecular 
weights and the position of FAAF attachment of the 




40 50 
Minutes 

Figure 2. (a) Chromatogram of a reaction mixture between 16-mer Narl sequence (5 / -CTCTCGiG 2 CG 3 CCATCAC-3 / ) and an activated FAAF (N- 
acetoxy-A^-2-(acetylamino)-7-fluorofluorene). The mono- (Gi, G 3 , G 2 ), di- and tri-FAAF adducts eluted in the 28-35, 42-60 and 84min were purified 
by reversed-phase HPLC (see 'Materials and Methods' section for gradient condition); (b) online photodiode array UV/Vis spectra of mono-, di- and 
tri-FAAF adducts. 



3942 Nucleic Acids Research, 2012, Vol. 40, No. 9 



three oligomers. The 16-mer ODNs were sequenced using 
3 / -5 / or 5 f ~y exonucleases as described previously for the 
analysis of modified 12-mers (28). Normally, 1 jig of a 
particular ODN was combined with 0.01 units of an exo- 
nuclease in a 1 mM solution of MgCl 2 and incubated for 
several hours. The digests were separated using a 
Phenomenex Aqua CI 8, 1.0 x 50 mm column (5 um; 120 
A). Solvent A was 5mM in both ammonium acetate and 
dimethylbutyl amine. Acetic acid was added to solvent A 
to adjust the pH to 7.0. Solvent B was 0.1% formic acid in 
acetonitrile. The flow rate was 100(il/min and total run 
time was 20min. All LC/MS spectra were acquired using a 
Waters SYNAPT quadrupole time-of-flight mass spec- 
trometer (Milford, MA, USA) operated in the negative 
ion and V-modes. The measured molecular masses of all 
three isomeric ODNs were within 0.1 Da of their theoret- 
ical monoisotopic mass (5016.9 Da). 

UV melting 

UV melting data were obtained using a Caryl 00 Bio UV/ 
VIS spectrophotometer equipped with a 6 x 6 multi-cell 
block and 1.0 cm path length. Sample cell temperatures 
were controlled by an in-built Peltier temperature control- 
ler. Oligonucleotide duplexes with a concentration range 
of 0.4-6.4 uM were prepared in solutions containing 0.2 M 
NaCl, 10 mM sodium phosphate and 0.2 mM EDTA at 
pH 7.0. Thermomelting curves were constructed by 
varying temperature of the sample cell (l°C/min) and 
monitoring absorbance at 260 nm. A typical melting ex- 
periment consisted of forward/reverse scans and was 
repeated five times. Thermodynamic parameters were 
calculated using the program MELT WIN version 3.5 as 
described previously (12). 

Circular dichroism 

Circular dichroism (CD) measurements were conducted 
on a Jasco J-810 spectropolarimeter equipped with a 
Peltier temperature controller. Typically, 2 ODs of each 
strand were annealed with an equimolar amount of a com- 
plementary sequence. The samples were dissolved in 400 ul 
of a neutral buffer (0.2 M NaCl, lOmM sodium phos- 
phate, 0.2 mM EDTA) and placed in a 1.0 mm path 
length cell. The samples were heated at 85°C for 5min 
and then cooled to 15°C, over a lOmin period to ensure 
complete duplex formation. Spectra were acquired every 
0.2 nm with a 2 s response time from 200 to 400 nm at a 
rate of 50nm/min, were the averages of 10 accumulations 
and were smoothed using 17-point adaptive smoothing 
algorithms provided by Jasco. 

Dynamic 19 F-NMR 

Approximately 20 ODs of a pure FAAF-modified 16-mer 
ODN was annealed with an equimolar amount of a com- 
plementary sequence to produce a fully paired duplex 
(Figure lb). The samples were then dissolved in 300 ul of 
typical pH 7.0 NMR buffer containing 10% D 2 O/90% 
H 2 0, lOOmM NaCl, lOmM sodium phosphate and 
100 uM EDTA, and filtered into a Shigemi tube through 
a 0.2 um membrane filter. All ! H- and 19 F-NMR results 
were recorded using a dedicated 5 mm l9 F/ l U dual probe 



on a Bruker DPX400 Avance spectrometer operating 
at 400.0 and 376.5 MHz, respectively, using acquisition 
parameters described previously (14, 20, 29). Imino 
proton spectra at 5°C were obtained using a phase- 
sensitive jump-return sequence and referenced relative to 
that of DSS. 19 F-NMR spectra were acquired in the 
^-decoupled mode and referenced relative to that 
of CFC1 3 by assigning external C 6 F 6 in C 6 D 6 at 
— 164.9 ppm. One and two-dimensional 19 F-NMR 
spectra were measured between 5 and 60° C with increment 
of 5-1 0°C. Temperatures were maintained by a Bruker- 
VT unit with the aid of controlled boiling liquid N 2 in 
the probe. Line shape simulations were performed 
as described previously (30) using WINDNMR-Pro 
(version 7.1.6; J. Chem. Educ. Software Series; Reich, H. 
J., University of Wisconsin, Madison, WI, USA). 

DSC experiments 

Calorimetric measurements of the three FAAF-modified 
16-mer duplexes were performed using a Nano-DSC from 
TA Instruments (Lindon, UT, USA). Prior to temperature 
scanning, samples were degassed for at least lOmin under 
house vacuum in a closed vessel. Solutions were loaded, 
respectively, into the sample and reference cells using a 
pipette by attaching a small piece of silicone tube at the 
end of the tip and were purged several times to get rid of 
air bubbles. After both cells were filled, they were capped 
and a slight external pressure (~3atm) was applied to 
prevent evaporation of the sample solution. Raw data 
were collected as microwatts versus temperature. 
Template-primer solutions were prepared by dissolving 
desalted samples in a pH 7.0 buffer solution consisting 
of 20 mM sodium phosphate and 0.1 M NaCl. In a 
typical scan, a 0.1 mM template-primer solution was 
scanned against buffer from 15°C to 90° C at a rate of 
0.75°C/min. At least five repetitions were obtained. A 
buffer vs. buffer scan was used as a control and subtracted 
from the sample scan and normalized for heating rate. 
This results in base-corrected AC/ X versus temperature 
curves. Each transition shows negligible changes in the 
heat capacities between the initial and final states, thus 
AAC p ex was assumed to be zero. The area of the resulting 
curve is proportional to the transition heat, which, when 
normalized for the number of moles of the sample, is equal 
to the transition enthalpy, AH. AH is an integration of 
AC/ X over temperature T. All sample solutions were 
0.1 mM concentration. T m was the temperature at half 
the peak area. AG and AS values have been determined 
according to the procedures described by Chakrabarti 
et al. (31). 

Substrate construction and UvrABC protein purification 

DNA substrates of 55 bp containing a FAAF adduct at 
each of three guanine residues were constructed as previ- 
ously described (11,32). Briefly, an FAAF-modified 
16-mer ODN (5 / -CTCTCG 1 G 2 CG 3 CCATCAC-3 / ) was 
ligated with flanking 20-mer ODN ^-GACTACGTACT 
GTTACGGCT-30 and 19-mer ODN (5 -GCAATCAGG 
CCAGATCTGC-30 ODN at the 5'- and 3'-end, respect- 
ively (Supplementary Figure SI). The 20-mer was 
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5 / -terminally labeled with P. The ligation product was 
purified by urea-PAGE under denaturing conditions. 
Following the purification, the substrate was annealed 
to the corresponding complementary strand, and then 
purified on an 8% native polyacrylamide gel. Similar 
procedures were employed to construct FAF- 
modified DNA substrates using 12-mer Narl sequence 
(5 / -CTCG 1 G 2 CG 3 CCATC-3 / ), which we prepared previ- 
ously for 19 F-NMR/UV/ICD studies (20). 

UvrA, UvrB and UvrC proteins were over expressed in 
E. coli and then purified as previously described (33). The 
estimated purity of the three proteins was >95%. A 
Bio-Rad Protein Assay was used to determine the 
protein concentration with BSA as the standard based 
on the manufacturer-recommended procedures. 

Nucleotide excision assay and quantification of incision 
products 

The S'-terminally labeled DNA substrates were incised by 
UvrABC as previously described (11,32). Briefly, the 
DNA substrates (2nM) were incubated in the UvrABC 
reaction buffer (50 mM Tris-HCl, pH 7.5, 50 mM KC1, 
10 mM MgCl 2 , 5mM DTT) at 37°C in the presence of 
UvrABC (UvrA, 10 nM; UvrB, 250 nM; and UvrC, 
100 nM). The Uvr proteins were diluted and premixed in 
Uvr storage buffer before addition to the reaction. 
Aliquots were collected at 0, 5, 10, 15 and 20min into 
the reaction. The reaction was terminated by heating at 
95°C for 5min. The products were denatured by addition 
of formamide loading buffer and heating to 95°C for 
5min, followed by quick chilling on ice. The incision 
products were then analyzed by electrophoresis on a 
12% polyacrylamide sequencing gel under denaturing 
conditions with TBE buffer. 

To quantify the incision products, radioactivity was 
measured using a Fuji FLA-5000 Image Scanner with 
MultiGauge V3.0 software. The DNA incised (in fmol) 
by UvrABC was calculated based on the total molar 
amount of DNA used in each reaction and the ratio of 
the radioactivity of incision products to total radioactivity 
of DNA. At least three independent experiments were per- 
formed for determination of the rates of incision. 



RESULTS 

Model sequences 

We previously used FAF-modified 12-mer duplexes 
(5 / -CTCG 1 G 2 CG 3 *C7VATC-3 / , TV = C or T) to probe the 
impact of flanking and 3'-next flanking sequences on 
Afarl-induced frameshift mutagenesis (20). Initially, we 
tried to use the same 12-mer Narl sequence for 
FAAF-modification for sake of comparison and consist- 
ency; however, the sequence was unsuitable for FAAF. 
Although FAAF adduction on the 12-mer Narl 
sequence was facile, a resulting reaction mixture was dif- 
ficult to purify on the reverse phase HPLC system (see 
asterisked peaks in Supplementary Figure S2). 
Moreover, ligation efficiencies of the FAAF-modified 
12-mers, particularly on the G r and G 2 -positions, were 
very low. Accordingly, the length of DNA was increased 



to 16 (5 / -CTCTCGGCGCCATCAC-3 / , Figure lb) by 
adding two nucleotides (underlined) on either side of the 
12-mer. The resulting FAAF-modified 16-mer ODNs were 
separated well on a reverse-phase HPLC and exhibited 
excellent ligation efficiencies (see below). 

Figure 2a shows an HPLC profile of a work-up mixture 
after 20min of reaction. Unreacted control ODN 
appeared at 21 min, followed by seven FAAF-modified 
ODNs in three retention time zones: Peaks 1-3 at 28- 
35 min, Peaks 4-6 at 42-60 min and Peak 7 at 84 min. 
Online UV (Figure 2b) of the modified ODNs displayed 
a small shoulder in the 290-320 nm range. The relative 
absorption intensities (290-320 nm) for the three peak 
groups were approximately 1:2:3. This finding is reminis- 
cent of AF- or FAF-induced absorption shoulders 
observed in 290-350 nm, whose intensities correlate con- 
sistently with the number of adduct modifications (28). As 
a result, Peaks 1-3 and 4-6 were assigned as mono- and 
di-adducts, respectively, and Peak 7 as a tri-adduct. These 
adducts were characterized by exonuclease digestion/ 
ESI-TOF-MS-MS analyses, as described below. 
Depending on the location of FAAF, the mono-adducts 
were designated as Narl-Gi, Narl-G 2 or Narl-G 3 , in 
which Gi, G 2 and G 3 signify the position of the 
FAAF-modified guanine. Details of the structural charac- 
terization and repair of the di- and tri-FAAF adducts will 
be published separately. 

The HPLC elution profile of the FAAF-modified 
16-mer ODNs in the present study is similar to that 
of the AAF-modified 15-mer Narl sequence 
(5 / -TCCTCG i G 2 CGaCCTCTC-3 / ) reported by Tan et al 
(34). These results indicate that AAF and FAAF are 
chromatographically comparable, irrespective of 
sequence length, as long as the common Narl core 
(underlined) is included in the sequences. This is not 
surprising since conformational and thermodynamic 
compatibilities of fluorine containing AF and AAF 
models have well been documented (20,30,35). A similar 
elution pattern was observed for the FAF-modified 
12-mer and FAAF-modified 16-mer Narl sequences; 
however, the order of elution of Gi and G 3 was reversed 
(compare Figure 2a with Figure 2 in Ref. 20). 

ESI-QTOF-MS characterization 

The molecular weights of all three FAAF-modified ODNs 
were measured by ESI-QTOF-MS prior to sequence veri- 
fication by exonuclease digestion. Ionization of ODNs 
normally occurs by the loss of a proton from a phosphate 
group in the ODN backbone. As the number of nucleo- 
tides in an ODN increases, the average charge state 
observed in the full scan mass spectra increases as well 
(28). As shown in Supplementary Figure S3, the ODNs 
containing 16 nucleotides form (M-4H) 4- ions predomin- 
antly unlike the 12-mers studied previously that form 
(M-3 H) 3_ primarily upon electrospray (28). Exonucleases 
cleave terminal deoxynucleotides from the ODN chain 
until the FAAF-modified nucleotide is exposed at the 
end of the chain. At that point the digestion reaction 
slows down significantly. The position of modification is 
identified (in this case) when the fragment(s) formed by 
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the loss of the unmodified guanine nucleotides is observed 
in the LC/MS spectra. This is shown in Supplementary 
Figure S4 for the 3' digest of the -Gi(FAAF)G 2 CG 3 - 
ODN. The ions observed at m/z 659.5 and m/z 989.7 are 
the (M-3 H) 3 - and (M-2H) 2 " ions formed from the 5'-CT 
CTCG^FAAF)^ ODN digest fragment. The observation 
of these ions confirms that this ODN is modified on the G 
closest to the 5' end. The LC/MS analysis of the 5' digest 
of the second modified ODN to elute is shown in 
Supplementary Figure S5a and 5b. The exonuclease diges- 
tion of this particular reaction product was particularly 
slow and evidence for endonuclease activity is observed 
in the mass spectra. All the Y fragments observed in 
Supplementary Figure S5 are (M-3H) 3- ions. The Y10 
and Y9 fragments at m/z 1069.2 (5 / -G 2 CG 3 (FAAF)CCA 
TCAC-3) and m/z 959.5 (5 / -CG 3 (FAAF)CCATCAC-3 / ) 
are formed by consecutive cleavages of unmodified 
guanines, confirming that this reaction product is 
modified on the guanine closest to the 3' end. LC/MS 
analysis of the exonuclease digests derived from the 
third singly-modified ODN to elute indicates that the 
FAAF group is attached to the central G in the 
sequence, -G 1 G 2 (FAAF)CG 3 -. LC/MS analysis of the 3' 
digest (Supplementary Figure S6) show ions at m/z 1154.7 
and m/z 769.5 corresponding to the (M-2H) 2- and (M- 
3 H) 3 ~ ions derived from the 5 / -CTCTCG 1 G 2 (FAAF)-3 / 
digest fragment. The mass spectra acquired from the 5'- 
digest (Supplementary Figure S7) show (M-2H) 2- and 
(M-3H) 3 " ions at m/z 1604.3 and m/z 1069.2 whose 
masses are consistent with 5 / -G 2 (FAAF)CG 3 CCATCAC 
-3' fragment. No ions formed by the loss of two guanine 
deoxynucleotides were observed in any of the mass 
spectra. The observation of ODN fragments with two 
G's in both 3' and 5' digests confirms that the last singly 
modified ODN to elute from the reaction mixture is 
modified on the middle G. 

Circular dichroism 

Figure 3 a shows an overlay of the CD spectra for the three 
FAAF-modified Nar\-G u -G 2 and -G 3 duplexes relative 
to the unmodified control (red). Unmodified and 
FAAF-adducted duplexes both displayed a positive and 
negative ellipticity at around 270 and 250 nm, respectively, 
which is an S-curve characteristic of a B-form DNA 



double helix. The modified duplexes displayed significant 
blue shifts relative to the unmodified duplex, Narl-G 3 
(6nm) >> Narl-Gi ~ G 2 (3nm), indicating adduct- 
induced DNA bending. A concomitant increase in the 
positive intensity around 270 nm was noted in the order 
of G 3 ~ Gi > G 2 , which could be due to the interaction of 
the intercalated S-conformeric FAAF with neighboring 
bases. We noted a similar blue shift and hyperchromic 
effect for the highly (75%) S-conformeric FAF-modified 
Nar\-G 3 duplex (Figure 3b and Table 1 in Ref. 20). 

More importantly, FAAF-modified duplexes exhibited 
sequence-dependent induced CD in the 290-320 nm range 
(ICD 290 _ 320nm ). This finding is reminiscent of ICD 290 _ 
350 nm, which has been used as a sensitive marker for the 
FAF-induced S/B/W-conformational heterogeneity 
(positive for S- and W- and negative for B-conformer) 
(12, 15, 20, 29, 36). In the present case, however, the 
FAAF-modified Narl duplexes exhibited negative dips, 
with the Nar\-G 2 duplex showing a greater dip than Gi 
or G 3 duplexes (Figure 3a). This result could be due to a 
higher ratio (57%) of B-conformer for the Nar\-G 2 
duplex. 

UV melting experiments 

Supplementary Figure S8 shows the UV melting profiles 
of the three FAAF-Afarl duplexes and an unmodified 
control duplex, all at 6.4 jiM. All duplexes showed 
typical monophasic, sigmoidal, helix-coil transitions 
with a strong linear correlation (R 2 >0.9) between T m ~ l 
and lnC t . Thermal and thermodynamic parameters 
calculated from UV melting are summarized in 
Supplementary Table SI. As expected, modified duplexes 
were destabilized thermally and thermodynamically 
relative to the control duplex. The magnitude of thermal 
(AT m ) and thermodynamic (A AG) destabilization was in 
the order of Narl-G 2 ~ Narl-G 3 (-8.7 to -8.8°C, 3.3 to 
3.7 kcal/mol, respectively) > Nar\-G x (— 4.6°C and 
2.0kcal/mol, respectively). 

Differential scanning calorimetry 

Figure 3b shows differential scanning calorimetry (DSC) 
plots of excess heat capacity C p GX versus temperature for 
the FAAF-Narl duplexes relative to the unmodified 
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Figure 3. (a) CD spectral overlays recorded at 15°C and (b) DSC curves recorded in 20 mM phosphate buffer containing 0.1 M NaCl at pH 7.0 of 
fully paired 16-mer Narl duplexes with FAAF modification at Gi (green), G 2 (blue) and G 3 (pink). 
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Table 1. Thermal and thermodynamic parameters of FAAF modified Narl duplexes obtained from differential scanning calorimetry 

y-CTCTCGiGsCGsCCATCAC-S' 
3 -GAGAGC C GCGGT AGTG- 5' 



-AH 
(kcal/mof 



-AS (eu) 



—AG 37 o c 
(kcal/mof 



?m a (°Q 



AAH h 
(kcal/mol) 



AAS C (eu) 



AAG 37 o c d 
(kcal/mof 



AT m e (°C) 



Control 
AfarI-G r FAAF f 
Narl-G 2 -¥AA¥ { 
Narl-G 3 -¥AA¥ { 



117.5 
95.6 
98.9 
92.8 



319.1 
260.2 
272.1 
254.4 



18.6 
14.9 
14.5 
13.9 



73.9 
68.6 
66.0 
65.6 



21.9 
18.6 
24.7 



58.9 
47.0 
64.7 



3.7 
4.1 
4.7 



-5.3 
-7.9 
-8.3 



a T m values is the temperature at half the peak area. 
h AAH = AH (modified duplex) —AH (control duplex). 
°AAS = AS (modified duplex) —AS (control duplex). 
d AAG = AG (modified duplex) -AG (control duplex). 
e AT m = T m (modified duplex) — T m (control duplex). 
f The average standard deviations for —AG, —AH and 



T m are ±0.4, ±3.0 and ±0.4, respectively. 



Table 2. Conformational heterogeneity (B/S/W), thermal destabiliza- 
tion and relative percent incision rates of FAAF- and FAF-modified 
Narl duplexes 



Narl Duplexes 
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rate c (%) 


7VtfrI-G r FAAF 


46 


34 


20 


-5.3 


93 


7VarI-G 2 -FAAF 


57 


15 


9 


-7.9 


32 


Narl-G 3 -¥AA¥ 


13 


61 


26 


-8.3 


100 


Narl-G i-¥A¥ 


42 


58 




-9.4 


44 


Narl-G 2 -¥A¥ 


69 


31 




-6.8 


43 


Narl-G 3 -¥A¥ 


35 


65 




-8.3 


25 



a The percent population ratios were calculated at 5°C on the basis of 
line simulations. 

b AT m = T m (modified duplex) - r m (control duplex). 

c Percent incision rate of modified duplexes with respect to Narl-G 3 - 

FAAF (100%). 



control. Table 1 summarizes the thermal and thermo- 
dynamic parameters derived from these DSC curves. 
Consistent with the UV melting data, the Narl-G 3 
duplex was most destabilized ( A AH = 24.7 kcal/mol, 
A AG 3TC = 4.7 kcal/mol, Ar m = -8.3°C), followed by 
Narl-G 2 (AAG 3TC = 4.1 kcal/mol, AT m = -7.9°C). 
Narl-G x was the least affected (AAG 37 o C = 3.7 kcal/mol, 
Ar m = -5.3°C). 

Differences in the thermal and thermodynamic destabil- 
izations must have arisen from the differences in the 
S/B/W-conformational characteristics. The most 
S-conformeric (61%) Narl-G 3 duplex causes disturbance 
of Watson-Crick base pairing, resulting in enthalpy reduc- 
tion (A AH = 24.7 kcal/mol) (Table 2). However, the large 
entropy (AAS = 64.7 eu) compensates for the enthalpy, 
thus resulting in the overall free energy loss of 
AAG 37 o C = 4.7 kcal/mol (37). In contrast, Narl-G x and 
-G2 duplexes possess higher populations of B-conformer 
(46 and 57%, respectively), thus exhibiting lower differ- 
ences in the enthalpy values (A AH = 21.9 kcal/mol and 
A AH = 18.6 kcal/mol, respectively) (Table 2). The rela- 
tively small enthalpy differences observed for the Gi and 
G 2 duplexes could be attributed to the presence of S- and 



W-conformers in addition to B-conformer. As expected, 
entropy compensation was less in these two duplexes 
(Narl-G x , A AS = 58.9 eu and Narl-G 2 , A AS = 47.0 
eu), yielding similar overall free energies (Narl-G 1, 
A AG = 3.7 kcal/mol and Narl-G 2 , A AG = 4.1 kcal/mol). 

S/B/W conformational heterogeneity 

Figure 4a shows the 19 F-NMR spectra of FAAF-Narl 
16-mer G r , G 2 - and G 3 -duplexes measured at 5°C, in 
which 19 F signals are in slow chemical exchange. These 
Afarl-FAAF duplexes exhibited three to five 19 F signals, 
each representing a particular conformation. The percent 
population ratios shown were calculated on the basis of 
line simulations as shown in Supplementary Figure S9. 
Assignments of the different 19 F signals of each duplex 
were necessary to carry out meaningful structure- 
activity-relationship studies. The signal assignments in 
Figure 4a were made initially on the basis of chemical 
exchange, ring current effect and chemical shift pattern 
recognition as have been done for a number of FAAF- 
and FAF-adducts in various sequence contexts (30,35). It 
has been demonstrated that AF and AAF adducts adopt 
the S/B- and S/B/W-conformational equilibrium, respect- 
ively (Figure lc and d) and their 19 F chemical shifts are 
independent of overall sequence and its length, but 
strongly rely on the nature of the bases flanking the 
lesion (15,30). The major 19 F signals in Figure 4a correlate 
well with the S/B/W-profiles reported previously for 
FAAF adducts (15), i.e. B-, S- and W-conformers going 
from downfield to upfield, i.e. -115.0 to -115.5, -115.5 
to -117.0 and -117.0 to -118.0ppm, respectively. Addi- 
tional signals were observed in the — 1 14. 0~— 1 15.0ppm 
range for the Narl-G 2 and -G 3 duplexes (Figure 4a, see 
Supplementary Table S2 for exact chemical shifts). 
Consistent with this observation, their proton spectra dis- 
played a mixture of broad imino signals arising not only 
from those involved in Watson-Crick hydrogen bonds 
(12-14 ppm), but also from the lesion site and its vicinity 
(ll-12ppm) (Supplementary Figure S10). 

To complement the 19 F signal assignments, we addition- 
ally conducted a set of comparative spectral analyses using 
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Figure 4. 19 F-NMR spectra of (a) FAAF-modified Narl 16-mer, (b) FAAF-modified non-Narl 12-mer and (c) FAF-modified Narl 12-mer duplexes 
at 5°C. *unknown conformers; # impurity. 



three FAAF- 12-mer duplexes in the non-Narl sequences 
(Figure 4b) in otherwise identical flanking sequence 
contexts (CG*G, GG*C and CG*C context for G 1? G 2 
and G 3 , respectively). The top trace in Figure 4b is the 
19 F-NMR spectrum of a FAAF-modified 12-mer duplex 
(5 / -CTTCT CG*C CCTC-3 / ), whose S/B/W conformation- 
al profiles have been well characterized (15). It should be 
noted that this non-Narl 12-mer duplex contains the iden- 
tical CG*C flanking sequence context as the 16-mer Narl- 
G 3 -FAAF duplex. Comparison of the two spectra (i.e. top 
traces of Figure 4a and b) revealed a parallel trend both in 
terms of chemical shifts and population ratios (Table 2 
and Supplementary Table S2), supporting the conform- 
ational assignments. This is consistent with our previous 
findings that the electronic environment for the 19 F signals 
of AF and AAF adducts are strongly modulated by the 
nature of flanking bases (15, 30). Similarly, we prepared 
two additional FAAF-modified non-Narl 12-mer duplexes 
(5 / -CTTCT CG*G CCTC-3 / and y-CTTCTC 

GG*C CTC-3Q with the same flanking base contexts 
(underlined) as the Nar\-G x and -G 2 duplexes, respective- 
ly. Figure 4a and b compares the 19 F-NMR spectra of all 
three Narl 16-mer and non-Narl 12-mer duplexes side by 
side. The 19 F signal profiles, as indicated by dotted lines 
(pink, B; red, S; green, W) for Gi^G 2 ^G 3 of each 
sequence context, match quite well overall despite of 
slight variations observed in chemical shifts and popula- 
tion ratios, particularly for the GG*C sequence context. 
Whereas the S- and W-conformer signals were prone to 
shift, the B-conformer signal appeared to be steady at 
— 115.5 ppm. This trend is more apparent in Supplemental 
Figure SI 1, in which the two FAAF-modified se- 
quence series (Narl- 1 6-mer versus non-Narl- 12-mer) are 
compared in a pair for each -CG*G-, -GG*C- and 



-CG*C- sequence contexts. It is plausible that the carcino- 
gen moiety in the major groove of the B-conformer is not 
subjected to the ring current effect, as the S- and 
W-conformers would be (14). We were unable to 
identify the minor signals (asterisked) in the 16-mer 
Narl-G 2 - (<19%) and -G 3 -FAAF duplexes, although 
their downfield shifts relative to the B-conformer imply 
B-like conformers, in which the fluorine containing car- 
cinogen moiety is exposed. 

Figure 4c shows the 19 F-NMR spectra of FAF-modified 
12-mer duplexes with the same Narl sequence contexts, 
which have been thoroughly characterized (20). The B/S 
conformer population ratios were determined to be 
42%:58%, 69%:31%, 35%:65% for FAF-modified 
Afarl-Gi, -G 2 and -G 3 , respectively, at 5°C (Table 2) 
(20). Although the chemical shift difference 
(0.4-1.0 ppm) for the B and S conformer of FAAF 
(Figure 4a) is significantly smaller than that (~ 1.5 ppm) 
of the FAF counterparts, their overall S/B ratios appear to 
match (Figure 4a and c). The B/S/W population ratios for 
the FAAF-Afarl-Gi, -G 2 and -G 3 16-mer duplexes were 
46:34:20, 57:15:9 and 13:61:26, respectively (Table 2). In 
both the FAAF- (Figure 4a) and FAF- (Figure 4c) Narl 
duplexes, the population of S-conformer decreased in the 
order of G 3 > Gi > G 2 and that of the B-conformer 
decreased in the reverse order, G 2 >Gi >G 3 . This 
comparative analysis was based on the assumption that 
structurally similar FAAF would experience similar 
sequence effects on their conformational profiles as 
observed by FAF in different sequence contexts of the 
Narl sequence (20). As expected, the aminofluorene- 
induced B/S-heterogeneity is strongly dependent on the 
nature of the flanking sequences, regardless of whether 
the lesion has the bulky acetyl group on the central 
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Figure 5. Dynamic 19 F-NMR spectra of fully paired 16-mer Narl duplexes. FAAF modification at (a) G b (b) G 2 and (c) G 3 . *unknown conformers; 
# impurity. 



nitrogen linking the carcinogen and the modified guanine, 
thus validating our assumption. 

Dynamic 19 F-NMR 

Figure 5 shows the 19 F-NMR spectra of the three FAAF- 
Narl duplexes as a function of temperature (5-60°C). 
Whereas the three 19 F signals in each duplex were in 
slow exchange at 5°C, the two downfield B- and 
S-signals became exchange broadened, giving rise to co- 
alescent signals at around 30, 40 and 25° C for G 1? G 2 and 
G 3 , respectively. In all cases, the merged signals coalesced 
with the upfield W-signal at around 60°C. All three Narl 
duplexes showed relatively strong off-diagonal contour 
peaks of the major signals in the exchange spectra (data 
not shown), confirming their chemical exchanges. 

UvrABC incisions of FAAF-adducts on Narl sequence 

Figure 6 shows the kinetic assay results, in which 55-mer 
FAAF-modified DNA duplex substrates were incised by 
UvrABC nuclease. These substrates were radioactively 
labeled at the 5'-end of the adducted strand. The major 
incision products can be seen as 18-mer (Narl-Gi), 19-mer 
(Narl-G 2 ) or 21-mer (Narl-G 3 ) separated on a urea- 
PAGE gel under denaturing conditions (Supplementary 
Figure SI 2). The incision occurred at the eighth phosphate 
bond 5' to the modified nucleotide, which is consistent 



with the previously reported results of UvrABC incision 
(11,32). 

Quantitative analysis of the incision indicated that the 
substrates were incised at different efficiencies, depending 
on where the damage site was located in the sequence 
(Figure 6). Specifically, the 7V-acetylated FAAF adducts 
at Narl-Gi and Narl-G 3 displayed similar rates of 
incision, whereas Narl-G 2 had a much lower rate of 
incision, G 3 (100%) >G ? (93%)>G 2 (32%) (Figure 6c, 
Table 2). For comparison, we also determined the 
UvrABC incision of FAF adducts in the same Narl 
sequence context. As shown in Figure 6b and c, the N- 
deacetylated FAF adducts in the same Narl sequence 
context were repaired 2- to 3-fold less than FAAF. 
Despite having similar B/S-conformer profiles (Figure 4 
and Table 2) the incision efficiency of FAF adducts 
at the three different sites in the Narl sequence followed 
the order of G { (44%) « G 2 (43%) > G 3 (25%), where the 
percentages were calculated relative to FAAF 7V<2rI-G 3 
(which was the most efficiently incised). 



DISCUSSION 

It is well known that DNA sequence is a major 
determining factor for repair outcomes of site-specifically 
modified bulky DNA lesions. In this study, we examined 
the conformational heterogeneity and thermodynamics of 
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Figure 6. Absolute percent incision rates of (a) FAAF and (b) FAF-Narl duplexes modified at Gi, G 2 and G 3 ; (c) percent incision rates histogram of 
FAF and FAAF at different positions relative to Narl-G 3 FAAF as 100%. 



FAAF and FAF at three different guanine positions (Gi, 
G 2 and G 3 ) of the well-known Narl recognition sequence. 
Moreover, we obtained NER data of these adducts using 
the E. coli UvrABC system. Table 2, which summarizes 
the S/B/W solution conformational heterogeneity, T m and 
NER efficiency results, presents strong evidence that the 
NER repair efficiencies of AAF and AF adducts in the 
Narl sequences are modulated by their conformational 
and thermodynamic properties. 

FAAF-induced B/S/W-conformational heterogeneity 

Our combined 19 F-NMR/ICD results show that FAAF 
adduct in a well-known mutational hotspot Narl 
sequence exist in a mixture of B/S/W conformers with 
varying populations (Figure 4a, Table 2). A greater popu- 
lation of sy/7-glycosidic S-(61%) and W-(26%) conformers 
was observed in Narl-G 3 , in which the lesion is flanked 
with C on both 5'- and 3'-ends (-CG 3 *C-). This result 
is consistent with the preferred ^^-conformation 
adopted by duplexes modified by AF (20), 2-amino-3- 
methylimidazo(4,5-y)quinolone (IQ) (38-40) and 2- 
amino- 1 -methyl-6-phenylimid-azo(4, 5-Z>)pyridine (PHIP) 

(41) , with the same -CG*C- contexts, in either Narl or 
non-Narl sequences (20,38,39,41). The mostly syn Narl- 
G 3 duplex appeared to be distorted, bent or possibly 
formed a B-Z junction, as evidenced by a significant 
blue shift and hyperchromic effect in CD (Figure 3a) 

(42) . The latter was probably due to the n-n stacking 



interaction between the intercalated aminofluorene and 
flanking base pairs. On the other hand, the Narl-G 2 
duplex (-GiG 2 *C-) exhibited largely the anti- 
B-conformer (57%) along with S- (15%), W- (9%) and 
two unidentified minor conformers (~19%). In compari- 
son to Narl-G 3 , the Narl-G 2 duplex exhibited smaller 
blue shift and hyperchromic effect (Figure 3a), suggesting 
lesser disturbance of the double helical DNA structure. 

These results indicate the heterogeneous nature of AAF 
in the Narl sequence and are consistent with a previous 
CD study that showed a major DNA distortion for AAF 
at G 3 adduct compared to Gi and G 2 (42). Similarly, 
Veaute et al. (43) conducted a DNase I footprint study 
on the Narl sequence and showed that AAF at the G 2 
position inhibits DNase I digestion of DNA at up to 
five bases in the modified strand and four bases in the 
complementary strand. In contrast, inhibition at the Gi 
and G 3 positions was extended to eight and six bases, re- 
spectively in the modified strand. 

The G 3 and G 2 duplexes are chemically isomeric, differ- 
ing only in the direction of the G:C base pair at the 
5 / -position (e.g. C:G G:C). Such a polarity swap is 
clearly responsible for the rather dramatic conformational 
shift from S- (61 to 15%) to B- (13 to 57%) and 
W-conformation (26 to 9%) (Table 2). A similar polarity 
switch at the 3'-end of the Narl-G\ duplex resulted 
in varying degrees of conformational shift in S- (61 to 
34%), B- (13 to 46%) and W-conformation (26 to 20%). 



Nucleic Acids Research, 2012, Vol. 40, No. 9 3949 



As expected, FAAF-modification at the three guanines 
of the 16-mer Narl sequence resulted in thermal (AT m = 
-5.3 to — 8.3°C) and thermodynamic (AAG 37 ° C = 3.7- 
4.7kcal/mol) destabilization relative to the unmodified 
control duplex (Table 1). The destabilizing effect of the 
FAAF modification was sequence-dependent and was 
related to the S/B/W-conformational profile. As 
summarized in Tables 1 and 2, the highly S-conformeric 
Narl-G 3 duplex (61%) promoted lesion stacking and dis- 
rupted the lesion site Watson-Crick base pairs, resulting 
in thermal (AT m = — 8.3°C) and enthalpic destabilization 
(AAH = 24.7kcal/mol). In contrast, the highly 
B-conformeric (57%) Narl-G 2 duplex exerted less 
enthalpy change (AAH = 18.6 kcal/mol). As expected, 
Narl-Gi (46% B, 34% S) produced an intermediate 
change in enthalpy (AAH = 21.9 kcal/mol). In all cases, 
however, enthalpy-entropy compensation resulted in a 
small overall difference (~1 kcal/mol) in thermodynamic 
destabilization (Table 1). A similar case could occur if the 
W-conformer was considered as a thermodynamic 
destabilizer, G 3 (26%) > Gi (20%) > G 2 (9%). We previ- 
ously studied three fully paired FAF-modified 12-mer 
duplexes in the same Narl sequence contexts (20). The 
UV melting results showed that FAF modification desta- 
bilizes the duplexes (AT m = -6.8 to -9.4°C, A AG = 4.2- 
4.6 kcal/mol) similarly. The highly S-conformeric (65%) 
Narl-G 3 duplex resulted in thermal destabilization 
(AT m = — 8.3°C), whereas the highly B-conformeric 
(69%) Narl-G 2 duplex exerted less destabilization 
(AT m = -6.8°C) (Table 2) (20). 

Conformation-specific nucleotide excision repair 

The E. coli UvrABC system displayed significant differ- 
ences in repair of the FAAF adduct at each guanine 
position (Gi, G 2 and G 3 ) of the Narl sequence. The 
Narl-G 2 duplex showed considerably lower efficiency 
than Narl-G x or 7V<2rI-G 3 , [G 3 (100%) >Gj (93%)>G 2 
(32%)] (Figure 6). It is clear from Table 2 that these NER 
results are in good agreement with the order of the 
S-conformer population [G 3 (61%)>Gi (34%) > G 2 
(15%)], but are in exactly the reverse order of the popu- 
lation of B-conformer, [G 2 (57%)>Gj (46%)>G 3 
(13%)]. This data suggest that the S-conformation is 
recognized and incised by E. coli NER dominantly over 
the B-conformation. We reported previously a similar 
conformation-specific NER results on a series of 
FAF-modified duplexes (16, 30). 

The carcinogen in the highly S-conformer Narl-G^ is 
base-displaced at the lesion site, thus resulting in a major 
disturbance in the DNA helical structure (AT m = — 8.3°C, 
AAG 37 °c = 4.7 kcal/mol) (Table 2). This finding is con- 
trasted with the 57% B-conformer 7V<2rI-G 2 duplex, 
which maintains Watson-Crick base pairs at the lesion 
site (Ar m =-7.9°C, A AG 37 o C = 4.1 kcal/mol). Similar 
correlations could be made with either the W-conformer 
alone, G 3 (26%)>Gi (20%)>G 2 (9%) or the syn- 
conformation (combined S and W), G 3 (87%) > Gi 
(54%) > G 2 (24%). 

For comparison, we also determined the UvrABC inci- 
sions of FAF-adducts in the same Narl sequence context. 



The two lesions revealed a similar B/S conformer hetero- 
geneity in the Narl sequence context (Figure 4). Therefore, 
the expectation was that FAF would show a similar NER 
profile as FAAF, i.e. S-/W-conformer promotes NER 
over B-conformer. However, the NER results revealed 
that incision efficiency was in the order of Gi « G 2 > G 3 
(Figure 6, Table 2). At first, this result appears to be in line 
with the B-conformer population. It should be noted that 
FAF is repaired consistently 2- to 3-fold less than FAAF 
(Figure 6c, Table 2). This result is a general trend reported 
in the literature, although much greater differences in 
incision efficiency between AF and AAF have been 
noted (11, 44). As a result, the difference between Gi 
and G 2 of FAF is not statistically significant (P = 0.83), 
but their difference with G 3 is significant (P < 0.0001). 

The incision differences between FAAF and FAF seem 
to suggest that, in addition to the sequence-dependent 
adduct conformation, the acetyl group in FAAF may 
play a role in DNA damage recognition by UvrABC. 
The only structural difference between FAF and FAAF 
is the absence of a bulky acetyl group on the linking 
nitrogen of the former (Figure la). It has been docu- 
mented that 7V-acetylated FAAF adducts in fully 
paired duplexes produce a mixture of complex 
S/B/W-conformers, whereas 7V-deacetylated FAF adopts 
a simple exchangeable S/B-equilibrium (15, 30). Thus, it is 
clear that the 7V-acetyl group is responsible for generating 
up to 26% W-conformer in the Narl sequence (Figure 4). 
The bulkiness of the acetyl group with its possibility for cis 
and trans rotamer transitions about the amide bond (14, 
15, 45) may facilitate the repositioning of the fluorenyl 
rings into the minor groove from the S conformation. 
This conformational rearrangement is relatively straight- 
forward since it does not require a change in the glycosidic 
bond, which is syn in both cases. We observed a good 
correlation between the proportion of W-conformation 
and NER efficiency of FAAF. 

Moreover, although FAF and FAAF have similar S/ 
B-conformational profile (Figure 4), the N- acetyl group 
in the latter could act as a 'conformational locker' to 
raise the energy barriers among conformers. Such a 
scenario, i.e. higher energy barriers of FAAF vs. FAF, is 
plausible and might contribute to a greater disturbance in 
DNA, and thus greater repair. By contrast, the N- 
deacetylated FAF adopts a facile interchangeable B/ 
S-equilibrium (<2 kcal/mol) that triggers weaker binding 
affinities with the damage-recognition protein UvrA. A 
recent crystal study indicated that the UvrA dimer does 
not contact the lesion site directly, but rather binds DNA 
regions on both sides of the modification and primarily 
recognizes adduct-induced unwinding, bending and de- 
formity in the overall DNA structure (25). Furthermore, 
DNA damage recognition in E. coli NER is achieved 
through a sequential 2-step mechanism (46). The initial 
step is to recognize the adduct-induced distorted DNA 
structure. After strand opening at the damage site, the 
DNA adduct structure is further recognized or verified 
in a second step, which may facilitate the flipping of the 
adducted nucleotide (47,48). Therefore, it is possible that, 
for FAF, the second step of recognition plays a more 
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important role than the first step, whereas the first step is a 
dominate recognition for FAAF. 

The order of NER efficiencies described here is roughly 
consistent with bacterial NER data on AAF adducts 
embedded in a similar Narl sequence (17): Gi (100%), 
G 3 (66%) and G 2 (18%). Sequence dependence was also 
found in human NER of AAF adducted in the Narl 
sequence (18). In contrast to the E. coli NER data, 
however, the AAF adduct at G 2 (100%) was found to 
be more repairable, followed by G 3 (68%) and Gi 
(38%). Despite differences in the nature of proteins 
involved in prokaryotic and eukaryotic NER, the two 
systems show similar involvement of (3-hairpin intrusion 
as damage recognition factors (49). Liu et al. (50) found a 
general qualitative trend toward similar relative NER 
incision efficiencies for 65% of bulky benzo[a]pyrene 
and equine estrogen substrates. Similar to bacterial 
UvrA, Rad4 (XPC) in yeast also recognizes helical distor- 
tion to sense DNA damage; unlike bacteria, yeast use a 
base-flipping mechanism for repair (26). Therefore, the 
efficiency of repair depends not only on the damage rec- 
ognition step, but also on other factors, such as ease of 
base flipping. 

In summary, our structural and thermodynamic data 
provide valuable conformational insights into the 
sequence-dependent UvrABC incisions of the bulky 
FAF and FAAF adducts in the Narl sequence context. 
Repair of the bulky 7V-acetylated FAAF adduct seems to 
occur in a conformation-specific manner, i.e. the highly S/ 
W-conformeric G 3 and Gi duplexes incised considerably 
more efficiently than the G 2 duplex (G 3 ~ Gi > G 2 ) (Table 
2). These results were supported by melting and thermo- 
dynamic data. Not surprisingly, FAF was repaired 2- to 
3-fold less than FAAF; however, the order of incision 
efficiencies was the reverse of that in the FAAF case. 
We considered the so-called 7V-acetyl factor and 
lesion-specific recognition mechanism for the different 
orders of incision for FAF and FAAF. Finally, the tem- 
perature dependence of the S/B/W-conformational 
equilibria of the FAAF-adducts in the Narl sequence 
could provide valuable opportunities for 
conformation-specific NER utilizing thermophilic 
UvrABC proteins (51). Taken together, the results of 
this study demonstrate the complexity of NER mechan- 
isms of bulky DNA lesions. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online: 
Supplementary Tables S1-S2, Supplementary Figures 
S1-S12. 

FUNDING 

National Institutes of Health (Grant number 
R01CA098296); RI-INBRE Research Core Facility sup- 
ported by the National Center for Research Resources (in 
part); National Institutes of Health (Grant number P20 
RR0 16457). Funding for open access charge: National 
Institutes of Health (Grant number R01CA098296). 



Conflict of interest statement. None declared. 



REFERENCES 

1. Luch,A. (2005) Nature and nurture - lessons from chemical 
carcinogenesis. Nat. Rev. Cancer, 5, 113-125. 

2. Melchior,W.B. Jr, Marques,M.M. and Beland,F.A. (1994) 
Mutations induced by aromatic amine DNA adducts in pBR322. 
Carcinogenesis, 15, 889-899. 

3. Neumann,H.G. (2007) Aromatic amines in experimental cancer 
research: tissue-specific effects, an old problem and new solutions. 
Crit. Rev. Toxicol, 37, 211-236. 

4. Friedberg,E.C, Walker,G.C, Siede,W., Wood,R.D., Schultz,R.A. 
and Ellenberger,T. (eds), (2006) DNA Repair and Mutagenesis, 
2nd edn. ASM Press, Washington. 

5. TruglioJ.J., Croteau,D.L., Van Houten,B. and Kisker,C. (2006) 
Prokaryotic nucleotide excision repair: the UvrABC system. 
Chem. Rev., 106, 233-252. 

6. Lehmann,A.R. (2003) DNA repair-deficient diseases, xeroderma 
pigmentosum, Cockayne syndrome and trichothiodystrophy. 
Biochimie, 85, 1101-1111. 

7. Van Houten,B. (1990) Nucleotide excision repair in Escherichia 
coli. Microbiol. Rev., 54, 18-51. 

8. Heflich,R.H. and Neft,R.E. (1994) Genetic toxicity of 
2-acetylaminofluorene, 2-aminofluorene and some of their 
metabolites and model metabolites. Mutat. Res., 318, 73-114. 

9. Beland,F.A. and Kadlubar,F.F. (1990) Handbook of Experimental 
Pharmacology. Spring, Heidelberg. 

10. Cho,B.P. (2004) Dynamic conformational heterogeneities of 
carcinogen-DNA adducts and their mutagenic relevance. J. 
Environ. Sci. Health C. Environ. Carcinog. Ecotoxicol. Rev., 22, 
57-90. 

11. Luo,C, Krishnasamy,R., Basu,A.K. and Zou,Y. (2000) 
Recognition and incision of site-specifically modified C8 guanine 
adducts formed by 2-aminofluorene, N-acetyl-2-aminofluorene and 

1- nitropyrene by UvrABC nuclease. Nucleic Acids Res., 28, 
3719-3724. 

12. Meneni,S.R., D'Mello,R., Norigian,G., Baker,G., Gao,L., 
Chiarelli,M.P. and Cho,B.P. (2006) Sequence effects of 
aminofluorene-modified DNA duplexes: thermodynamic and 
circular dichroism properties. Nucleic Acids Res., 34, 755-763. 

13. Patel,D.J., Mao,B., Gu,Z., Hingerty,B.E., Gorin,A., Basu,A.K. 
and Broyde,S. (1998) Nuclear magnetic resonance solution 
structures of covalent aromatic amine-DNA adducts and their 
mutagenic relevance. Chem. Res. Toxicol., 11, 391-407. 

14. Zhou,L., Rajabzadeh,M., Traficante,D.D. and Cho,B.P. (1997) 
Conformational heterogeneity of arylamine-modified DNA: 19F 
NMR evidence. /. Am. Chem. Soc, 119, 5384-5389. 

15. Patnaik,S. and Cho,B.P. (2010) Structures of 

2- acetylaminofluorene modified DNA revisited: insight into 
conformational heterogeneity. Chem. Res. Toxicol., 23, 1650-1652. 

16. Meneni,S., Shell,S.M., Zou,Y. and Cho,B.P. (2007) 
Conformation-specific recognition of carcinogen-DNA adduct in 
Escherichia coli nucleotide excision repair. Chem. Res. Toxicol., 
20, 6-10. 

17. Seeberg,E. and Fuchs,R.P. (1990) Acetylaminofluorene bound to 
different guanines of the sequence -GGCGCC- is excised with 
different efficiencies by the UvrABC excision nuclease in a 
pattern not correlated to the potency of mutation induction. 
Proc. Natl Acad. Sci. USA, 87, 191-194. 

18. Mu,D., Bertrand-Burggraf,E., HuangJ.C, Fuchs,R.P., Sancar,A. 
and Fuchs,B.P. (1994) Human and E.coli excinucleases are 
affected differently by the sequence context of 
acetylaminofluorene-guanine adduct. Nucleic Acids Res., 22, 
4869-4871. 

19. Burnouf,D., Koehl,P. and Fuchs,R.P. (1989) Single adduct 
mutagenesis: strong effect of the position of a single 
acetylaminofluorene adduct within a mutation hot spot. Proc. 
Natl Acad. Sci. USA, 86, 4147-4151. 

20. Jain,N., Li,Y., Zhang,L., Meneni,S.R. and Cho,B.P. (2007) 
Probing the sequence effects on Afarl-induced -2 frameshift 
mutagenesis by dynamic 19F NMR, UV, and CD spectroscopy. 
Biochemistry, 46, 13310-13321. 



Nucleic Acids Research, 2012, Vol. 40, No. 9 3951 



21. Broschard,T.H., Koffel-Schwartz,N. and Fuchs,R.P. (1999) 
Sequence-dependent modulation of frameshift mutagenesis at 
Atari-derived mutation hot spots. /. Mol. Biol, 288, 191-199. 

22. Koffel-Schwartz,N. and Fuchs,R.P. (1995) Sequence determinants 
for -2 frameshift mutagenesis at Afarl-derived hot spots. J. Mol. 
Biol, 252, 507-513. 

23. Mekhovich,0., Tang,M. and Romano, L.J. (1998) Rate of incision 
of N-acetyl-2-aminofluorene and N-2-aminofluorene adducts by 
UvrABC nuclease is adduct- and sequence-specific: comparison of 
the rates of UvrABC nuclease incision and protein-DNA complex 
formation. Biochemistry, 37, 571-579. 

24. Zou,Y., Shell,S.M., Utzat,C.D., Luo,C, Yang,Z., Geacintov,N.E. 
and Basu,A.K. (2003) Effects of DNA adduct structure and 
sequence context on strand opening of repair intermediates and 
incision by UvrABC nuclease. Biochemistry, 42, 12654-12661. 

25. Jaciuk,M., Nowak,E., Skowronek,K., Tanska,A. and Nowotny,M. 
(2011) Structure of UvrA nucleotide excision repair protein in 
complex with modified DNA. Nat. Struct. Mol. Biol, 18, 
191-197. 

26. MinJ.H. and Pavletich,N.P. (2007) Recognition of DNA damage 
by the Rad4 nucleotide excision repair protein. Nature, 449, 
570-575. 

27. Cho,B.P. and Zhou,L. (1999) Probing the conformational 
heterogeneity of the acetylaminofluorene-modified 
2'-deoxyguanosine and DNA by 19F NMR spectroscopy. 
Biochemistry, 38, 7572-7583. 

28. Gao,L., Zhang,L., Cho,B.P. and Chiarelli,M.P. (2008) Sequence 
verification of oligonucleotides containing multiple arylamine 
modifications by enzymatic digestion and liquid chromatography 
mass spectrometry (LC/MS). J. Am. Soc. Mass Spectrom., 19, 
1147-1155. 

29. Jain,N., Meneni,S., Jain,V. and Cho,B.P. (2009) Influence of 
flanking sequence context on the conformational flexibility of 
aminofluorene-modified dG adduct in dA mismatch DNA 
duplexes. Nucleic Acids Res., 37, 1628-1637. 

30. Meneni,S.R., Shell,S.M., Gao,L., Jurecka,P., Lee,W., SponerJ., 
Zou,Y., Chiarelli,M.P. and Cho,B.P. (2007) Spectroscopic and 
theoretical insights into sequence effects of aminofluorene-induced 
conformational heterogeneity and nucleotide excision repair. 
Biochemistry, 46, 11263-11278. 

31. Chakrabarti,M.C. and Schwarz,F.P. (1999) Thermal stability of 
PNA/DNA and DNA/DNA duplexes by differential scanning 
calorimetry. Nucleic Acids Res., 27, 4801^806. 

32. Zou,Y., Liu,T.M., Geacintov,N.E. and Van Houten,B. (1995) 
Interaction of the UvrABC nuclease system with a DNA duplex 
containing a single stereoisomer of dG-(+)- or dG-(-)-anti-BPDE. 
Biochemistry, 34, 13582-13593. 

33. Zou,Y. and Van Houten,B. (1999) Strand opening by the 
UvrA(2)B complex allows dynamic recognition of DNA damage. 
EMBO J., 18, 4889^901. 

34. Tan,X., Suzuki,N., Grollman,A.P. and Shibutani,S. (2002) 
Mutagenic events in Escherichia coli and mammalian cells 
generated in response to acetylaminofluorene-derived DNA 
adducts positioned in the Nar I restriction enzyme site. 
Biochemistry, 41, 14255-14262. 

35. Meneni,S., Liang,F. and Cho,B.P. (2007) Examination of the 
long-range effects of aminofluorene-induced conformational 
heterogeneity and its relevance to the mechanism of translesional 
DNA synthesis. /. Mol. Biol, 366, 1387-1400. 

36. Liang,F., Meneni,S. and Cho,B.P. (2006) Induced circular 
dichroism characteristics as conformational probes for 



carcinogenic aminofluorene-DNA adducts. Chem. Res. Toxicol., 
19, 1040-1043. 

37. Liang,F. and Cho,B.P. (2010) Enthalpy-entropy contribution to 
carcinogen-induced DNA conformational heterogeneity. 
Biochemistry, 49, 259-266. 

38. Elmquist,C.E., Wang,F., StoverJ.S., Stone,M.P. and Rizzo,C.J. 
(2007) Conformational differences of the C8-deoxyguanosine 
adduct of 2-amino-3-methylimidazo[4,5-f]quinoline (IQ) within the 
Narl recognition sequence. Chem. Res. Toxicol., 20, 445-454. 

39. Wang,F., DeMuro,N.E., Elmquist,C.E., StoverJ.S., Rizzo,C.J. 
and Stone,M.P. (2006) Base-displaced intercalated structure of the 
food mutagen 2-amino-3-methylimidazo[4,5-f]quinoline in the 
recognition sequence of the Narl restriction enzyme, a hotspot for 
-2 bp deletions. J. Am. Chem. Soc, 128, 10085-10095. 

40. Wang,F., Elmquist,C.E., StoverJ.S., Rizzo,C.J. and Stone,M.P. 
(2007) DNA sequence modulates the conformation of the food 
mutagen 2-amino-3-methylimidazo[4,5-f]quinoline in the 
recognition sequence of the Narl restriction enzyme. Biochemistry, 
46, 8498-8516. 

41. Brown,K., Hingerty,B.E., Guenther,E.A., Krishnan,V.V., 
Broyde,S., Turteltaub,K.W. and Cosman,M. (2001) Solution 
structure of the 2-amino-l- methyl-6-phenylimidazo[4,5-b]pyridine 
C8-deoxyguanosine adduct in duplex DNA. Proc. Natl Acad. Sci. 
USA, 98, 8507-8512. 

42. Koehl,P., Valladier,P., LefevreJ.F. and Fuchs,R.P. (1989) Strong 
structural effect of the position of a single acetylaminofluorene 
adduct within a mutation hot spot. Nucleic Acids Res., 17, 
9531-9541. 

43. Veaute,X. and Fuchs,R.P. (1991) Polymorphism in N-2- 
acetylaminofluorene induced DNA structure as revealed by 
DNase I footprinting. Nucleic Acids Res., 19, 5603-5606. 

44. Gillet,L.C, Alzeer J. and Scharer,O.D. (2005) Site-specific 
incorporation of N-(deoxyguanosin-8-yl)-2-acetylaminofluorene 
(dG-AAF) into oligonucleotides using modified 'ultra-mild' DNA 
synthesis. Nucleic Acids Res., 33, 1961-1969. 

45. Shapiro,R., Hingerty,B.E. and Broyde,S. (1989) Minor-groove 
binding models for acetylaminofluorene modified DNA. 

J. Biomol. Struct. Dyn., 7, 493-513. 

46. Zou,Y., Luo,C. and Geacintov,N.E. (2001) Hierarchy of DNA 
damage recognition in Escherichia coli nucleotide excision repair. 
Biochemistry, 40, 2923-2931. 

47. Malta,E., Verhagen,C.P., Moolenaar,G.F., Filippov,D.V., van der 
Marel,G.A. and Goosen,N. (2008) Functions of base flipping in 
E. coli nucleotide excision repair. DNA Repair, 7, 1647-1658. 

48. Malta,E., Moolenaar,G.F. and Goosen,N. (2006) Base flipping in 
nucleotide excision repair. J. Biol. Chem., 281, 2184-2194. 

49. Scharer,O.D. (2011) Multistep damage recognition, pathway 
coordination and connections to transcription, damage signaling, 
chromatin structure, cancer and aging: current perspectives on the 
nucleotide excision repair pathway. DNA Repair, 10, 667. 

50. Liu,Y., Reeves,D., Kropachev,K., Cai,Y., Ding,S., 
Kolbanovskiy,M., Kolbanovskiy,A., BoltonJ.L., Broyde,S., Van 
Houten,B. et al. (2011) Probing for DNA damage with 
beta-hairpins: similarities in incision efficiencies of bulky DNA 
adducts by prokaryotic and human nucleotide excision repair 
systems in vitro. DNA Repair, 10, 684-696. 

51. Ruan,Q., Liu,T., Kolbanovskiy,A., Liu,Y., Ren J., Skorvaga,M., 
Zou,Y., LaderJ., Malkani,B., Amin,S. et al. (2007) Sequence 
context- and temperature-dependent nucleotide excision repair of 
a benzo[a]pyrene diol epoxide-guanine DNA adduct catalyzed by 
thermophilic UvrABC proteins. Biochemistry, 46, 7006-7015. 



